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ABSTRACT 



Context. Sources generating most of the X-ray background (XRB) are dispersed over a wide range of redshifts. Thus, statistical char- 
acteristics of the source distribution carry the information on the matter distribution on very large scales. 
Aims. We test the possibility to detect the variation of the X-ray source number counts over the celestial sphere. 
Methods. A large number of Chandra pointings spread over both galactic hemispheres is investigated. A search for all the point-like 
sources in the soft band of 0.5 - 2 keV is performed, and statistical assessment of the population of sources below the detection thresh- 
old is carried out. A homogeneous sample of the number counts at fluxes above ~ 10"'* erg s^'cm"- for more than 300 ACIS fields 
was constructed. The counts correlations between overlapping fields were used to assess the accuracy of the computational methods 
used in the analysis. 

Results. It is shown that the source number counts vary between fields at the level only slightly larger than the fluctuation amplitude 
expected for the random (Poissonian) distribution. Nevertheless, small asymmetry between galactic hemispheres is present. The aver- 
age number of sources in the northern hemisphere is larger than in the southern at the 2.75 cr level. Also the autocorrelation function 
of the source density in both hemispheres are substantially different. 

Conclusions. Possible explanations for the observed anisotropies are considered. If the effect is unrelated to the observational se- 
lection, a large scale inhomogeneities in the distribution of X-ray sources are required. Correlations of the source number counts 
observed in the southern hemisphere could be generated by a coherent structure extending over ~ 1200 Mpc. 

Key words. X-rays: diffuse background - X-rays: general - large-scale structure of Universe 



1. Introduction 

The largest structures identified in galaxy redshift surveys are lo- 
cated at distances comparable to the maximum distance at which 
structures can be effectively distinguished. In the CfA redshift 
survey b eyond the Great Wall n ot much structure is at all recog- 
nizable (iGeller & Huchralll989l) . In the SDSSQ the large galaxy 
filaments are discernible up to roughly 500 Mpc, and the mos t 
prominent feature is the Sloan Great Wall (iGott et all l2005b . 
found at a distance of ~ 300 Mpc. 

Larger structures potent ially could be isolate d in the quasar 
part of the SDSS. Recently Clo wes et al.l (12012 *) report a group 
of 73 quasars that span redshifts 1.17 - 1.37. A total length of 
this twisted filament exceeds 1200 Mpc. Albeit Park et al.l(l2012h 
have shown that largest structures found in the galaxy distribu- 
tion are consistent with the canonical ACDM model, it is plau- 
sible that the quasar alignment could be discordant with this 
model. 

It appears that the question of the large structures could be 
addressed using the X-ray surveys. Most of extragalactic X-ray 
sources are associated with various types of Active Galactic 
Nuclei (AGN). Because of a strong cosmic evolution and a 
wide luminosity function, the AGN observed in the selected 
flux range are distributed within a very wide range of redshifts. 
Equivalently, the observed source flux is only weakly corre- 
lated with the source distance. Roughly 90 percent of the X- 
ray background in the 0.5 - 2keV energy band is generated 



by sources brighter than 3 ■ 10"'^ erg s 'cm^^ (e.g. lMoretti et al.l 
2003). A majority of these sources are located at cosmologica l 
distances: 80 percent in the redshift range 0.3-2.2 (ISohanl2008h . 
The comoving radial distance between redshifts 0.3 and 2.2 ex- 
ceeds 4100 MpcB Thus, sources detected in a moderately deep 
Chandra observation are tracers of the matter distribution at Gpc 
scales. 

In the deepest Chandra fields of 2 and 4 Ms exposure sev- 
eral hundred sources are detected. This allows to investigate the 
source distribution along the line of sight, provided the redshifts 
of objects are known. However, to investigate large scale mat- 
ter distribution, a transverse dimension of the survey should be 
of a comparable size as the radial one. In the present paper we 
explore the possibility to analyze the giant structures in the ob- 
servable Universe using a large number of the Chandra medium 
depth observations scattered over the whole sky. Although a sin- 
gle pointing represents a pencil-like cut through space, a set of 
several hundred pointings to some extent act as a uniform all-sky 
survey. The main objective was to select from the Chandra Data 
Archive all the pointings suitable for source detection, and then 
to construct an efficient method to count the sources in all the 
fields in a homogeneous way. Number of sources in each field 
was used to estimate the amplitude of the source counts, N{S ), 
at a fixed flux range around 10 ergs^'cm"^. A power law 
parametrization of the N{S ) relationship was applied. Statistical 
characteristics of the N{S) amplitude variations are examined. 



Send offprint requests to: A. M. Soltan 
' http://www.sdss.org/ 



Assuming "737" ACDM cosmology, i.e. H„ 
= 0.3, and SIa = 0.7. 
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Fig. 1. The distribution of 80 pointings in the northern galactic 
hemisphere. Two circles represent galactic latitudes b = 30° and 
b - 60°. Each observation covers a circle of 10 arcmin in diam- 
eter and the symbol sizes are not to scale. Labels close to some 
points in the map denote the actual number of pointings in the 
area. All the pointings are detached. Full dots indicate pointings 
with high statistical weights (see Sect.|4|. 

Results, albeit preliminary, demonstrate strong and weak points 
of the method. The volume of the archive Chandra observa- 
tions is still too small for a comprehensive study. Nevertheless, 
the presently available data reveal the potential ability of X-ray 
source counts in the investigation of the large scale matter distri- 
bution. 

The organization of the paper is as follows. In the next 
section, selection and preparation of the observational data ex- 
tracted from the Chandra archives is described. In Sect. |3] pro- 
cedures to estimate the amplitude of the number source counts, 
A^(5' ), are presented. Two independent methods to determine the 
counts in two separate flux levels are used. Results of these cal- 
culations, as well as statistical and (possible) systematic eiTors 
are discussed in Sects. |4] and |5] The main conclusion are sum- 
marized in Sect.|6] 

2. Selection of observational material 

The Chandra Data Archive has been searched for all the ACIS- 
I observations at high galactic latitudes with the exposure time 
above 30 ks. Originally, a selection limit of \b\ > 30° has been 
applied for both hemispheres. Then, because of a small number 
of adequate pointings, a search for more observations in a belt of 
25° < b < 30° in the southern galactic hemisphere has been per- 
formed. Eventually, within this extension only 4 pointings with 
27° < b < 30° satisfied all the selection criteria (see below) and 
were included into the final set of data. 

All the "interesting" observations were handled through 
the standard data processing pipelines at the Chandra X-ray 
Centefl Each observation was carefully inspected in terms of 

^ See http ://asc . harvar d.edu/ciao | 



270' 




180" 



90" 

SoutherTi galactic ticriiispi:!cre 



Fig. 2. Same as Fig [T] for 41 pointings in the southern galactic 
hemisphere. 



usefulness for the present investigation. Pointings centered at 
galactic clusters, nearby galaxies (e.g. Magellanic Clouds or 
NGC galaxies) have been discarded. Also pointings with dom- 
inating extended source have been removed. If the extended 
source occupied a relatively small fraction of the field of view, a 
section with the source was cut-off, and the remaining data un- 
affected by the extended emission have been used. Fields with 
extremely strong point sources have been treated in a similar 
way, i.e. only areas affected by linear smearing generated dur- 
ing a readout have been removed. All the exposures were exam- 
ined for the presence of the background flares and only "good 
time intervals" (GTI) were used. Because of a severe deteriora- 
tion of the image quality with increasing off-axis angle, search 
for sources was restricted to the circular area of 5 arcmin radius 
centered on chips 0-3. 

More than 300 individual pointings have been qualified for 
the analysis. Their spatial distribution is highly nonrandom. 
Concentrations in various angular scales, as well as the strong 
asymmetry between galactic hemispheres are present. Since a 
large number of pointings covered overlapping areas of the sky, 
the data have been divided into two samples. The detached point- 
ings, which cover separate areas, constitute a principal sample 
used in the investigation. The second auxiliary sample consist- 
ing of overlapping pointings was used to assess uncertainties of 
our statistical analysis. 

The sky distribution of the non-overlapping pointings is pre- 
sented in the polar equal area projection in Figs. [T]and|2]for the 
north and south galactic hemispheres, NGH and SGH, respec- 
tively. Symbols in figures strongly exceed in size the actual ex- 
tent of the individual observation. Numerals close to some dots 
denote numbers of pointings concentrated in the marked areas. 
In the NGH 106 pointings have been selected for furtherprocess- 
ing, while in the SGH - 49. Full dots represent pointings charac- 
terized by high statistical weights (80 in NGH and 41 in SGH); 
open circles - observations that suffer from the largest statisti- 
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cal uncertainties. A detailed discussion of statistical properties 
of the selected material is presented in Sect. |4] 

The data in a "canonical" soft energy band of 0.5 - 2keV 
have been used. Because of various reasons, both the expo- 
sure time and sensitivity are highly variable function of po- 
sition within the field of view. To reduce effects of sensitiv- 
ity variations, low sensitivity areas have been delineated using 
the a Chandra effective exposure mapQ- For each observation, 
the maximum amplitude of the exposure map was determined. 
Pixels with the exposure map below 75 % of this maximum value 
have not been used. Although this constraint reduces the origi- 
nal area of 5arcmin circle by roughly 10%, it s ecures for ou r 
purpose an acceptable uniformity of the exposure (ISoltanll201 Ih . 



3. 'Bright' and 'faint' source counts 

Two independent methods have been applied to estimate the 
number of discrete point-like sources in a single observation. 
The first one was based on a standard procedure of finding 
prominent concentrations of counts within a detection cell. 
Sources isolated in this way are labele d here as 'bright'. A sec- 
ond method, described in ISoltanl(l2011h . estimates number of the 
'faint' sources. It calibrates deviations of the photon distribution 
from the Poissonian one using the nearest neighbor statistics. It 
is not able to isolate individual sources, but provides statistical 
information on the population of sources generating small num- 
ber of photons. 

To count 'bright' sources in a single observation, a circular 
cell was slided over the entire investigated area. The cell size 
was set to enclose 85 % of counts generated by a point source. 
Radius of the detection cell varied as a function of the off axis 
angle according to the shape of th e Chandra p oint spread func- 
tion (PSF). The PSF data given bv lJerius et alj (2000) have been 
used. The search for 'bright' sources has been performed in the 
same manner for all the pointings. The iterative procedure was 
applied. Using the sliding box technique, the strongest concen- 
tration of photons was localized and it was recognized as the 
brightest source in the field of view. Then, the area surround- 
ing this source was extracted from the data and the search for 
the second brightest source was performed in the same way. 
The procedure was repeated for the consecutive sources until the 
number of counts selected as the next source dropped below the 
threshold adopted individually for each observation. The thresh- 
old counts were selected at the level sufficiently high to prevent 
random fluctuation of counts to be approved as a real source. It 
was assumed in this step that the background counts were dis- 
tributed randomly over the field of view. The search for sources 
has been terminated when the probability of random accumula- 
tion of counts exceed 10"^. 

Our data sets span a wide range of exposure times, from 
30 ks through more than 170ks. Because of that the pointings 
differ in the source detection threshold and in the number of de- 
tected sources. To use the observed number of sources as the 
estimator of the source counts per unit solid angle, which is 
independent of the exposure time, one needs to adopt an ana- 
lytic model for the counts and to relate the observed number of 
sources in the individual field to the parameter(s) of the model. 



An exposure map is generated in a standard data processing. It is a 
position dependent product of the effective area of the mirror/detector 
combination and the effective exposure time [in cm^s]. 



A power law is the obvious choice for the source number 
counts per unit solid angle (hereafter per 1 sq. deg): 



dNjS) 
dS 



(1) 



with two a priori free parameters, normalization amplitude No 
and the slope a. However, the limited number of sources as well 
as a narrow dynamic range of fluxes populated by sources pro- 
hibits us from attempts to measure the slope a. Also the data 
on the source counts available in the literature for a couple of 
deep fields do not provide information on the potential field to 
field slope variations. In those few cases where the dynamic 
range within a single field allows to track changes of the slope, 
it appears that the slope is constant o ver the whole range cov - 
ered by the present investigation (e.g. iGeorgakakis et al.ll2008h . 
Therefore, in the present investigation only the normahzation is 
fitted for each field separately, while a single, universal value of 
the slope is assumed. 

A several step procedure is applied to find the best estimate 
of No using the observed number sources. Because the each en- 
hancements of counts within the detection cell is considered 
a 'source', one should carefully evaluate a contribution of the 
background counts and apply the adequate statistical correction. 
Let nsik) is the number of detected 'sources' containing k counts, 
where k = + is a sum of the genuine X-ray photons emit- 
ted by a source k^, and the background counts k^. No is related 
to ns{k) in the following way: 



n,{k)^A)^ 



Pbkgfe) J dS^^P(k,\S) 



(2) 



where A is the solid angle of the observation (in sq. deg), Pbkg(^b) 
denotes the probability that k^ background counts would be 
found in the detection cell, and P(A:s 1 5 ) is the probability that 
a source of flux S generates k^ photons (see Eq.Hl). The proba- 
bility of finding k^ counts within the detection cell is given by 
Poissonian distribution: 



Pbkg(^b) = 



(3) 



where A is the expected number of random counts within the 
detection cell. 

The sum over the background counts in Eq. |2] generally 
includes the k^^ -Q component. However, the minimum value of 
k was set to warrant Pbkg(^b - k) < 10"^, and the last component 
of the sum is effectively negligible. It is convenient to use the 
instrumental count in each observation as a flux unit. The flux s 
in ACIS counts is related to S in physical units by a conversion 
factor 77: s - S/rj, with 77 in erg s"' cm"^ count"' is obtained 
from the exposure map. The probability P{ks \ S ) now takes the 
form: 



P(k, I s) 



k. 



(4) 



Substituting into Eq. |2]the source number counts in new units 



dn(s) 
ds 

where 



«„ - AN, 



or] 



(5) 



(6) 
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and assuming the constant slope a over a wide flux range, we 
finally get: 



k-\ 



ib=0 



Pbkg(^b) 



T{k, + a + \) 

nh + 1) 



(7) 



Equation I2] summed over k gives the expected total number 
of 'bright' sources. To evaluate n,,, the number of actually de- 
tected 'bright' sources has been substituted in the left-hand side 
of Eq.|2] The amplitude A^,, was obtained using Eq.|6l The statis- 
tical properties of the No distribution in our sample are discussed 
in the next section in conjunction with the analogous distribution 
obtained for 'faint' sources. 

The population of sources below the formal source detection 
threshold, labeled as 'faint', is investigated using a statistical ap- 
proach. Here only the basics of the method are presented. All 
the details and potentia l applications were discussed in a sepa- 
rate paper (ISoltanll2011h . 

After the removal of all the 'bright' sources, the remain- 
ing counts are a mix of the non-X-ray events, truly diffuse X- 
ray background and counts generated by a population of faint 
sources. Only this last constituent is responsible for fluctuations 
of the count distribution characteristic for point sources. One can 
express the total number of events, n,, within the investigated 
area in the following form: 



n\ + 112 + ■■■ + nu 



(8) 



where iii represents counts distributed randomly, - counts pro- 
duced by sources contributing exactly 2 counts each ('2-photon 
sources'), n-i - counts by '3-photon sources' and so on up to nk„.^^ 
- counts due to the brightest sources left in the field (i.e. the 
brightest among the 'faint' ones). Counts distributed randomly 
constitute a composite collection of events that includes also the 
weakest discrete sources contributing single photons. Photons 
coming from sources producing 2 < k < fc^ax counts create lo- 
cal enhancement of a unique shape determined by the telescope 
PSF. Statistical characteristics of these variations are efficiently 
quantified using the nearest neighbor statistics (NNST) of the 
count distribution. The probability that randomly chosen event 
has no neighbors within a distance r. 



Pir)^ piP{r\l) + p2P{r\2) + ... + Pk^.^P{r\k^,,) , 



(9) 



where is a probability that this event is not related to any 
source generating 2 or more counts, pk for k - 2, Aimax is the 
probability that this event is produced by the A:-photon source, 
and P{r | k) is the conditional probability that there are no other 
counts within r provided the selected event belongs to the k- 
photon source. Under the reasonable assumptions, for k >2: 



P(r\k)^ P(r\l)-r(r\k), 



(10) 



where P(r | k) is the probability that within r there are no other 
counts from the same source. This quantity is uniquely defined 
by the PSF. Taking Uk - k - Usik) and pk - rtk/n, and combining 
Eqs.miiniEl and0we get: 



T{k + a + \) 



{\-r{r\k)] = P{r\\)-P(r).{n) 



Equation [TT] allows us to determine n,, provided the relevant 
probability distributions are known. One can estimate the proba- 
bility Pif) for each observation by measuring the distance to the 
nearest neighbor for each photon in the field. Similarly, P(r 1 1) is 



given by the distribution of distances to the nearest photon from 
the randomly distributed points. The PSF related probability dis- 
tribution 'P{r\k) was investigated in detail by Soltan (2011). It 
has been determined as a function of the off axis angle using 
several Chandra observations of bright point sources. 

The best estimate of was determined in a following way. 
All the probability distributions have been calculated for several 
(usually more than 20) distances r, which uniformly covered a 
domain of distributions P{r \ 1) and P{r). Then, the cumulative 
probability distributions in Eq. [TT] were replaced by the corre- 
sponding probability densities, e.g. AP(r) = P{r) - P(r + Ar) 
for the consecutive distances r. In all the calculations Ar = 
0.246 arcsec (= 1 /2 of the instrumental pixel) has been used. 
Thus, a set of independent equations covering all the observed 
separations has been constructed, and the best fit value of ng was 
determined using the least square method. Finally, the count am- 
plitude No was obtained using Eq.|6]as for the 'blight' sources. 

4. Counts correlations 

The amplitudes of the number source counts determined for 
'bright' and 'faint' sources (hereafter denoted A^o[^] andA^o[/], 
respectively) define the number source counts over adjoining, 
but different flux levels. Therefore, the median flux of 'bright' 
and 'faint' sources in each observation are not widely separated 
and No[b] and No[f] should be similar. Nevertheless, one can ex- 
pect systematic diff'erences between both estimates, if the count 
slope adapted in the calculations, a, did not match the slope ac- 
tually observed in the data. The differential slope between 10"'^ 
and 10 '"^ergs 'cm^^ i n the deep Cha ndra field s is well fitte d 
with the slope of -1.58 jGeorpakakis et al.i,2008l:[Soltanll20I Ih . 
Using a = -1.58 in the present calculations, the average am- 
plitude of the 'faint' source counts is larger by ~ 5 % then that 
for the 'bright' sources. Both amplitudes are practically identi- 
cal for a = 1 .60. To preserve internal consistency of the inves- 
tigation, the latter slope has been used in the subsequent cal- 
culations. Although this is steeper by 0.02 than the best fit by 
iGeorgakakis et al.[ (12008 '). it stays within theirs Icr limits. One 
should note that the present investigation is not suitable to as- 
sess the count slope either within the individual field or the aver- 
age over the sky. This is because most systematic errors involved 
in the estimates of the number of 'bright' sources are indepen- 
dent from the eiTors affecting the 'faint' sources assessment. The 
source counts amplitudes, No[b] and No[f] are functionally de- 
pendent with the slope a. Therefore, a unique slope is required 
to balance both amplitudes. In fact, modification of the slope by 
just 0.02 demonstrates that the systematics do not play a sig- 
nificant role in the analysis. One should stress also that all the 
conclusions of the paper are not affected by a selection of the 
particular value of the slope. 

Apart from the systematic errors, the No[b] and No[f] am- 
plitudes are subject to large statistical uncertainties that generate 
substantial scatter. The average number of 'bright' sources per 
field Hs - 23.6. Thus, the Poissonian scatter alone introduces 
~ 21 % rms fluctuations to the present estimate of No[b]. In re- 
ality, still larger variations of No[b] are expected due to source 
clustering. The A^,, estimates for the 'faint' sources are afflicted 
by even larger errors, because the estimate of the count ampli- 
tudes is derived indirectly. Two factors contribute most to the fi- 
nal uncertainties. First, the amplitude estimate is proportional to 
the difference between two probability distributions (see Eq.lTTt. 
and both distributions are subject to substantial statistical eiTors. 
Second, the 'faint' sources contribution to the total events is 
small, typically 2 - 6 %, while all the remaining counts are dis- 
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Fig. 3. Amplitudes No of the source number counts for flux S in 
10"'^ ergs^'cm"^ (see Eq.[T]) for the 'bright' and 'faint' sources 
in the sample of 155 pointings; 121 pointings with the combined 
rms uncertainty of No smaller than 134 per sq. deg are shown 
with dots, 34 pointings with larger rms - with crosses. 



tributed randomly. Therefore the net signal, represented by the 
right-hand side in Eq. [TT] is also very modest in comparison to 
the noise that is dominated by the random scatter of the P(r \ 1) 
distribution. 

In Fig.[3]the 'bright' and 'faint' source ampUtudes A^,, as de- 
fined in Eq. [1] where S is in 10 ergs"' cm~^, are plotted for 
the full sample of the non-overlapping 155 pointings. The rms 
scatter of No[b] is in fact comparable to that expected from the 
Poissonian fluctuations of the 'bright' source numbers, while the 
No[f] variance is substantially larger. The rms for 'faint' sources 
amounts to 34 % of the average amplitude of No [/] . Despite the 
scatter, 'bright' and 'faint' amplitudes are strongly correlated. 
The correlation coefficient for 155 data points p = 0.2968. The 
non-directional probability that the data are drawn from the un- 
corrected general population amounts to 0.00018 (or 0.000089 
for the directional case). To minimize statistical noise, the un- 
certainties of the individual No estimates are carefully examined 
(see below). Due to a wide range of exposure times in the pro- 
cessed observations, these uncertainties strongly differ between 
each other. In order to check the influence of the poor quality es- 
timates on the subsequent statistical analysis, pointings with the 
largest uncertainties have been identified. In Fig. |3]these point- 
ings are shown with crosses. 

Although the No[f] estimates are strongly affected by the 
statistical noise at the level of the photon distribution within the 
each pointing, significant correlation between No[f] and No[b] 
in the whole sample favors the calculations of a single amplitude 
No derived simultaneously from the 'bright' and 'faint' sources. 
In order to combine the No[b] and No[f] estimates into a single 
'best' estimate of the source number counts amplitude for each 
pointing, an adequate error estimate and data weighting for each 
measurement are essential. The errors for the 'bright' source 
counts have been calculated using the Poissonian noise, since 



this effect is dominating. In the first step it was assumed that the 
underlying population of sources is the same for all the point- 
ings. Next, the expected number of the observed 'bright' sources 
has been determined for each pointing separately taking into 
account the exposure time. Finally, the Poissonian distribution 
around the expected number of sources was used to compute the 
corresponding dispersion cro(No[b]) individually for each point- 
ing. 

The No[f] uncertainties result from a complex combination 
of the Poisson-like fluctuations of the number of faint sources, 
contribution of background counts, and a stochastic character 
of the photon distribution at angular scales comparable to the 
width of the PSF In eff'ect, the variance of No[f] cannot be eas- 
ily derived from theoretical considerations, but instead it is esti- 
mated directly from the observed distribution. No clear correla- 
tion of the variance with the exposure time (or the total number 
of events) has been found. It was evident that the cosmic variance 
contributed marginally to the observed No[f] scatter. Therefore, 
equal uncertainties have been assign to all the observations. As 
an error of the individual No[f] estimate, o-a(No[f]), the rms 
scatter in the whole sample had been assumed: cr(No[f]) = 242 
per sq. deg. 

Our final best estimate of No is a weighted mean of No[b] 
and No[f] with weights inversely proportional to the squares of 
the rms uncertainties of both estimates. The resulting uncertain- 
ties of No, cro{No), are spread between 101 and 149 per sq. deg. 
To reduce an influence of the low statistical quality data for fur- 
ther analysis, observations with the highest uncertainties have 
been removed from the sample. A threshold of o-o{No) = 134 
per sq. deg has been set as a compromise between the acceptable 
uncertainties and the size of the sample. The resultant sample of 
121 observations is shown with dots in Fig. [3] and the remain- 
ing 34 pointings with o-o(No) > 134 per sq. deg - with crosses. 
A removal of 34 observations that suffer from the largest uncer- 
tainties substantially increases the correlation between A^o[i>] and 
No[f]- For 121 data points p = 0.3589. The corresponding non- 
directional probability for the uncotTelated general population is 
5 • 10"^. Unless otherwise stated, the discussion below refers to 
the sample of 121 observations 

The average number counts amplitude N„ = 705 per sq. deg. 
with the rms scatter cr(No) - 145 per sq. deg. The flux limits 
that define 'bright' and 'faint' sources depend on the exposure 
time, but the flux separating both source classes is always close 
to 10"'^ erg s"' cm"-. Therefore the present A^,, estimate charac- 
terizes the source counts just around this flux level. 

To compare the present result with the source number counts 
based on several deep Chandra fields we adopt the count 
parametrization by iGeorgakakis et al.l (l2008l) . Their data con- 
verted to the units used here give dN/dS — 574 per sq. deg at 
S - 10"'^ ergs"' cm"^, i.e. our figure is 23 % higher. Although 
this difference partially could be attributed to various system- 
atic effects inherent to the present calculations, one should note 
that our amplitude is the average of a large number of pointings 
distributed over the whole sky, while the Chandra Deep Fields 
cover few selected areas. Applying the present method to the se- 
lection of 54 observations in t he CD FS gave A^,, = 601 + 88, 
per sq. deg, what exceeds the IGeorgakakis et al.l (120081) result 
merely by 5 %. The quoted uncertainty represents the rms scatter 
among the 54 observations. Although all these data are localized 
at CDFS, the fields cover slightly different areas. Therefore vari- 
ations of No stand not just for the statistical noise generated by 
our procedures, but include also actual fluctuations in the num- 
ber of sources 
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The rms scatter of the cr^'s was taken as the uncertainty of the 
actually observed signal. 

The subsample of overlapping observations is dominated by 
few groups centered on Chandra Deep Fields. Some widely 
separated bins which happen to include the separation between 
these fields contain pairs predominantly drawn from the selected 
areas. To eliminate this bias, the for the separations larger 
than lOarcmin have been determined using the sample of the 
non-overlapping observations alone. 

A substantial decline of the pair rms for overlapping point- 
ings clearly indicates diminishing contribution of the cosmic 
variance. It is legitimate to assume that at the smallest sepa- 
rations the amplitude of cTp - 65.8 + 9.6 per sq. deg repre- 
sents the scatter of No due to imperfections of the applied pro- 
cedures and is independent from the varying cosmic signal. 
Since the observed rms of the No distribution in the sample of 
121 non-overlapping observations amounts to cr^, = 144.8per 
sq. deg, the intrinsic variance of the counts amplitude crcv = 
129.0+^-^ per sq. deg, or (18.3+;}^)% of the average A^^ ampli- 
tude. Apparently the Poissonian fluctuations provide a dominant 
contribution to the observed scatter. 



Fig. 4. The distribution of the 'pair rms', i.e. rms difference be- 
tween the counts amplitudes in pairs of observations as a func- 
tion of the pair separation (see text for detailed explanation). 
The dotted line separates the overlapping pointings from the de- 
tached ones. The lowest separation bin contains pairs closer than 

0. 1 arcmin. 

To assess the uncertainties genuine to the method, and clean 
from the contribution generated by the cosmic variance, all 264 
observations (including overlapping) have been used in the fol- 
lowing way. For all the possible pointing pairs a difference be- 
tween amplitudes No was obtained: A^ = No{i) - Noij), for 

1, j - 1, 264, and / j. The pairs were divided in several 
bins according to the pair angular separation Then, for the each 
separation bin the rms of A, y has been calculated: 

where N/j denotes the number of pointing pairs in the given sep- 
aration bin, and '2' in the denominator is introduced to normal- 
ize CTp to the rms of No- For perfectly overlapping pointings, the 
'pair rms' o-p represents the scatter intrinsic to the method, while 
for widely separated fields, o-p is a quadratic sum of the intrin- 
sic scatter and the cosmic variance. If the observations partially 
overlap, intermediate amplitudes of o-p are expected. Potentially, 
for close, but non-overlapping fields a coiTelation signal gener- 
ated by the large scale structures could also be present. 

In Fig. ID the distribution of o-p for several bins is shown. 
The abscissa positions give the average pair separation in the 
bin. The lowest separation bin accommodates pairs for which 
the field centers are separated by less than 0.1 arcmin. The verti- 
cal dotted line separates the overlapping fields from the detached 
ones. One sigma error bars were calculated using the bootstrap 
Monte Carlo method. A large number (10000) of simulated sets 
of observations were generated. The original sky coordinates 
have been used and a random amplitude No was drawn from 
the the full assembly of 121 No estimates. Then, using Eq. [T2l 
the pair rms o-p have been calculated for each simulated data set. 



5. Counts in the north and south galactic 
hemispheres 

No obvious variations of the pair rms is present for the non- 
overlapping pointings averaged over the whole data. It indicates 
that any potential strong fluctuations of the counts amplitude 
have angular scales that typically do not exceed a size of the 
individual pointing. Nevertheless, some differences in statisti- 
cal properties of the No distribution between both galactic hemi- 
spheres seem to be present. 

Unfortunately, the observations useful for the present analy- 
sis are not distributed evenly in both hemispheres. Of 121 point- 
ings in the basic sample, 80 is found in the northern hemisphere, 
and 41 in the southern. The average amplitude and the rms scat- 
ter of the number counts in the north ern and southern hemi- 
spheres are A^„(N) = 731.6 + 135.3 and NoiS) = 654.7 + 150.5. 
Assuming that the uncertainty of the average scales as one over 
square root of the number of observations, we have cr[A^o(N)] = 
15.1 and cr[A^o(S)] - 23.5. Then, the uncertainty of the differ- 
ence between both averages equals to 27.9 and the signal-to- 
noise of the difference reaches 2.75. The K-S two sample test ap- 
plied to the both No distributions indicates that the northern and 
southern data sets differ at the confidence level of 0.95. Thus, 
global properties of both hemispheres reveal some differences, 
but at rather low significance. 

Further details of the No distribution are revealed using the 
autocoiTelation functions (ACF). These functions have been cal- 
culated in both hemispheres separately over a wide range of an- 
gular distances, and the results are shown in Fig. |5] Data for the 
northern hemisphere (shown with crosses) are consistent with no 
coiTelation signal for the non-overlapping fields, i.e. at angular 
scales above lOarcmin. The ACF uncertainties have been as- 
sessed using the Monte Carlo method in the same way as for the 
o-p. The ACF in the observed part of the SGH (shown with dots 
in Fig. |5]l exhibits distinctly different characteristics. The cor- 
relations apparent here indicate substantial large angular scale 
variations of the count amplitude. The positive ACF amplitude 
seems to extend up to ~ 30° and it is conspicuously inconsis- 
tent with the ACF in the northern cap (the negative ACF signal 
at larger separations arises in a natural way because the size of 
whole investigated SGH region is just few times larger than the 
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Fig. 5. The autocorrelation function for the No distribution in 
the northern (crosses) and southern (dots) hemisphere. The er- 
ror bars show 1 cr uncertainties. 



apparent correlation length). The Icr uncertainties in the SGH 
data are larger than in the NGH due to smaller number of obser- 
vations and larger rms scatter of the No amplitudes in the former 
hemisphere. 



6. Discussion 

Variations of the source number counts have been analyzed us- 
ing the Chandra observations distributed in both galactic hemi- 
spheres. The counts amplitude was obtained using two indepen- 
dent and complementary methods. The dominating contribution 
in the observed scatter comes from the Poissonian noise induced 
by the relatively small average number of sources per field. This 
effect is seriously confusing a search for the potential coiTela- 
tions in the A^,, distribution. Scarce observational data, particu- 
larly in the south galactic hemisphere, prevent us from draw- 
ing conclusions at high confidence level. The pointings are dis- 
tributed highly non homogeneously in the celestial sphere in 
the large as well as in small angular scales. Due to these de- 
ficiencies, the picture emerging from the present investigation 
is to some extent ambiguous. Fluctuations of the counts ampli- 
tude averaged over the whole sample of 121 observations do 
not exhibit any irregularities which could be symptomatic for 
the existence of the large angular scale structure of the XRB. 
However, statistical properties of the data split into the northern 
and southern hemispheres reveal noticeable differences between 
the source distributions. In the northern hemisphere the ACF am- 
plitude does not exceed a few thousandths and is within errors 
consistent wi th zero. This r esult is in agreement with our inves- 
tigation (Solt an et al.ll996h based on the /;05Ar All-Sky Survey 
(RASS). We analyzed a section of the NGH above b - 40° with 
70° < / < 250° in the energy band 0.73 - 1.56keV. The RASS 
ACF amplitude in that region drops to ~ 2 ■ 10"^ at separations 
61 ~ 1° and below 5 ■ 10"^ at > 5°. 



A distinctly different picture emerges from the present in- 
vestigation in the SGH. Here at separations below ~ 30° the 
ACF amplitude rises to ~ 0.02 and differs from zero with sig- 
nificance above 3cr. Close inspection of the A^,, distribution in 
both hemispheres indicates that the ACF amplitude in the SGH 
is generated by a group of pointings with lower than average A^,, 
amplitudes in the longitude quadrant 240° < / < 330°. It is clear 
that the ACF signal determined in the SGH is discordant with 
the ACF in the NGH using the present data as well as with the 
ACF of the RASS determined again from a section of the NGH. 
The eff'ect undoubtedly deserves further study. One should note, 
that the apparent differences between hemispheres cannot be ex- 
plained by inhomogeneities of the soft X-ray absoiption by cold 
gas within the Galaxy. To generate a 12% difference in the num- 
ber of sources in both hemispheres, hydrogen column densities 
should differ by almost 10^' cm"^, what is excluded by the ob- 
servational data0 

Although all the pointings were carefully examined, the 
correlations in the No distribution could be a result of a con- 
spiracy of the instrumental effects or observational selection. 
Nevertheless, a genuine cosmic signal should not be ruled out. 
The average counts amplitude of 14 pointings in the SGH be- 
tween longitudes 240° and 330° is lower by 26% than the av- 
erage amplitude of all the remaining 107 pointings. Such de- 
cline of the source counts could be realized by a single void or 
by a radial alignment of several voids. In the case of the single 
void, one can estimate its line of sigh t extent usi ng the model of 
the source redshift distributions by So ltanI (12008!) . The minimum 
size of 1200Mpc in comoving coordinates is obtained with the 
void center at redshift of ~ 0.8. This size is comparable to the 
longest dimension of the quasar chain found by Clow es et al.l 
(1201 2|) in the SDSS. Transverse extents of the supposed void are 
rather weakly constrain due to poor sampling. Assuming that the 
subtended angle of the void corresponds to the correlation length 
of the ACF, the transverse size is of the same order as the radial 
one. Thus, the volume of the void w ould be much larger than the 
volume of the structure repotted by IClowes et al 1 (120121) . 

A large number of the new pointings would improve statis- 
tical significance of the investigations. Unfortunately, one can- 
not expect a rapid inflow of the new Chandra observations. 
Therefore, a more promising way to proceed would be to search 
for the correlation between the existing Chandra observations 
and the data in other available X-ray surveys. In particular, the 
relationship between the counts amplitude determined here and 
the RASS SGH maps should be investigated. 
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